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Abstract 

> : 

OO , We propose an alternative way to represent graphs via OBDDs based on the observation 

that a partition of the graph nodes allows sharing among the employed OBDDs. In the 
second part of the paper we present a method to compute at the same time the quotient 
w.r.t. the maximum bisimulation and the OBDD representation of a given graph. The pro- 
posed computation is based on an OBDD-rewriting of the notion of Ackermann encoding 
of hereditarily finite sets into natural numbers. 



1 Introduction 

In this paper we consider the problem of computing and representing the bisimula- 
tion on a given Kripke structure. Such a problem, central in Model Checking, has 
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■ been tackled by many authors and various solutions have been given. In particular, 

the algorithm proposed by Kanellakis and Smolka IjKannellakis and Smolka 1990|l is 
used in many model checkers with explicit-state representation (e.g., XEVE), while 
the algorithm proposed by Bouali and de Simonc ( Bouali and de Simone 1992|l is 
used in the case of symbolic representation (e.g., NuSMV). The algorithms pre- 
sented in ( |Bouajjani et al. 1990| ) and IjLee and Yannakakis 1992| are designed to 
obtain better performance in the case of the so-called On-the-Fly Model Checking. 
The routine proposed by Paige and Tarjan in ( |Paige and Tarjan 1987| l is still the 
best in terms of worst case time complexity {0{\E\ log |-/V|)). In IjDovier et al. 200T)l 
it has been proposed an algorithm which works on explicit representations and 
which, in the worst-case, has the same time complexity as the one by Paige and 
Tarjan, but which in many cases reaches a linear time complexity 0(|i?|) and re- 
quires less space during the computation. 

A particularly interesting line of research is witnessed by the work presented in 
l|Bouali and de Simone 199'2| focussed on the design of algorithms whose execution 
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can be easily coordinated with efficient (symbolic) representation techniques for 
the underlying Kripke structures. The OBDD data structures (see ( [Bryant 1986| )) 
employed in Symbolic Model Checking (see IjMcMillan 1993|l ). allow the storage in 
memory of much larger structures and set new standards for the size of Kripke 
structures to which Model Checking can be applied. However, it is not always 
immediate to map the further space-saving method of bisimulation reduction on 
OBDD-representation of Kripke structures: especially designed algorithms must be 
proposed for this purpose and different complexity parameters must be considered 
IIKisler a,nd Va,rdi 1 999. .SoTnenzi 1999|l. 

In this paper we propose an alternative way to represent graphs via OBDDs which 
is based on the use of a (generic) partition on the nodes and which allows sharing 
among the employed OBDDs. Instead of using an OBDD to represent the graph's 
relation, after splitting the graph into blocks, for a given block Bj we use an OBDD 
whose size depends upon the list Dj of nodes reachable in one step from Bj only. 
If the graph has 2" nodes, then the unique OBDD of the standard representation 
has 2u levels, while with our technique we use OBDDs with u + kj levels, where 
kj — log \Dj\. Moreover, some sharing becomes possible: for example if all the Djs 
have approximately the same cardinality, then they share the last kj levels. 

When the initial partition is based on a suitable notion of rank we are able to 
propose a method for bisimulation computation as well as OBDD-representation, 
i.e. the output of the routine is the bisimulation quotient represented using our 
OBDD-encoding. The technique is based on the fact that a Kripke structure is 
nothing but a (in general redundant) representation of an hereditarily finite set. 
To this end, starting from an encoding a la Ackermann (see (jAckermann 19371 
[Levy 1979| )) of hereditarily finite sets as OBDDs, we propose an extension of such 
an encoding to non-well-founded hereditarily finite sets and a computation tech- 
nique exploiting the a priori bound on the size of the set to be encoded for its 
determination. The key notion behind our encoding is an extension of the notion 
of rank to non- well- founded sets, already used in IjDovier et al. 200T|l for explicit 
bisimulation computation. 

The paper is organized as follows: in Section[21we present the problem; in Section 
|31we review some related works; in Section 0| we consider techniques for OBDDs 
representation of graphs; in Section |S1 we propose the alternative representation, 
based on a layering of the graph, for which the basic operations are discussed in 
Sectional in Section[71we propose the method to compute the OBDD representation 
of the quotient of an acyclic graph w.r.t. the maximum bisimulation; in Section|Slwe 
complete our discussion extending the method to the cyclic case. Some concluding 
remarks, including a short discussion on the similarity between Ackermann encoding 
and OBDD representation, end the paper. 
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PART ONE 

Basics 

2 Preliminaries 

Definition 1 {Rooted Graph) 

A (direct) rooted graph^ is a triple G = {N,E,r), where N is the set of nodes, 
E C N X N is the set of edges, and r G iV is a node such that all the other nodes in 
N are reachable from r, i.e. for all a G there is a sequence ai, . . . , a^, of elements 
of N such that rEai, ah = a, and atEat+i for all 1 < i < /i — 1. 

From now on and for reasons that will become clear below, the relation E will be 
denoted by > (a stylized format for both 9 and —^) and its inverse relation E^^ 
will be denoted by <. We will use the term graph to refer also to rooted graphs. 
In the rest of this paper we will mostly deal with the following problem: 

Given a graph G = {N, >,r) determine a 'compact' representation of G/ =, 
where = is the maximum hisimulation over G. 

The problem is well-known in the area of Model Checking, where graphs have 

usually labels on nodes. We briefly recall the definition of bisimulation on graphs 
with and without labels and we justify the fact that in this paper we deal with 
graphs without labels. 

Definition 2 {Labelled Graph) 

Let E be a finite alphabet. A labeled graph is a triple G = {N, >,r,£), where 
{N, >,r) is a rooted graph and £ : A/' — > S is a labeling function. 

Definition 3 {Bisimulation) 

Given a labeled graph G = {N,>,r,i) a bisimulation is a relation B C N x N 
which satisfies: 

(label) a B a' ^ £{a) = £{a'); 
(forw.) aBa'Aa>b=^ 3b'{a' >b' AbB b'); 
(back.) aBa' Aa' >b' ^3b{a>bAbBb'). 
Given a graph G = {N,>,r) a bisimulation is a relation B C N x N which 
satisfies the conditions (forward) and (backward). 

We have given the definition of bisimulation B C N x N on a graph G = {N, >, r), 
but it is immediate to imagine the definition of a bisimulation B C Ni x N2 between 
two graphs Gi = (A^i, >i, ri) and G2 = {N2, >2, ^2): it is, essentially, only necessary 
to add the condition that r\Br2- 

The main theorem on maximal bisimulations states that: 

Theorem 4 

Given a graph G (with or without labels) there always exists a (unique) maximum 
bisimulation = which is an equivalence relation. 

^ In this paper we always refer to finite graphs. 
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Proof 

See (lAczel. 1988|l . □ 

We will say that two nodes a, a' of a graph are bisimilar if and only if a = a' . 

The quotient structure G/ = can be seen as the (most compact) graph represen- 
tation of the non- well- founded set (see IjAczel. 1988|l ) associated to the root of G 
and with > acting as the membership relation 9, (|Dovier et al. 20 07). 

Given a graph G' = {N', >' ,r' , £) it is possible to encode the labeling on the nodes 
by adding new nodes to the graph and obtaining a new graph G — {N, >, r) in such 
a way that there is a complete correspondence between the maximum bisimulation 
over G" and the maximum bisimulation over G. In some applications the graphs also 
have labels on the edges and the definition of bisimulation takes into consideration 
both the labels on the edges as well as the labels on the nodes. By adding new nodes 
it is possible to encode also the labeling of the edges. A possible (linear) encoding 
for the labels on the edges and on the nodes is described in IjDovier et al. 2001)1 : 
hence, it will not be restrictive to consider the case of graphs without labels. 

In this paper we propose a method to determine an OBDD representation of the 
quotient under the maximum bisimulation of a given graph G = {N, >,r). If one 
prefers not to move from a graph G to its unlabelled G" it is straightforward to 
extend the technique in order to cope directly with the labels. In the following we 
first give some references about related work, then we start with some reflections 
about symbolic representations and we propose an OBDD representation based on 
a partitioning of N which is at the core of our method to compute the maximum 
bisimulation. 

3 Related work 

The following material is related with both OBDDs and bisimulation. 

OBDDs, ordered binary decision diagrams, are a canonical representation for 
boolean functions, i.e. two boolean functions are equivalent if and only if they are 
associated to the same OBDD. General BDDs were first introduced in IjLee 1959|l 
and in ll Akers 1978|l . Bryant, defining in ( [Bryant 1985| ) the more restricted notion 
of OBDDs attracted attention to their use in logic design verification. 

OBDDs are used in Model Checking to represent the labelled graph which models 
the behavior of the system. Such a representation, usually called symbolic or implicit 
representation, allows to deal with systems with much more states than an explicit 
representation as noticed in IjBurch et al. 199211 . 

Unfortunately, it is possible, in the worst case, that the symbolic representa- 
tion of a system is as large as the explicit one, see fSom enzi 1999(1 . Many authors 
have tried to solve this problem using different techniques. The size of an OBDD 
depends on the ordering of the variables, hence methods to determine "a good" 
variable ordering, based on the use of heuristics which try to exploit the struc- 
ture of the system representation IjMalik et al. 1988)l and on dynamic reordering 
l|R,udell 1993,1 . have been proposed. However, there are many applications where 
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these optimization techniques for OBDDs reach their hmits. For a fixed state en- 
coding there are many finite state machines whose OBDD representations are large 
regardless of the variable orders, see IjAziz et al. 1994|l . When OBDDs are used to 
represent graphs the size of the OBDD depends also on the state encoding, and to 
this end in IjMeinel and Theobald 200T)l local encoding transformations have been 
studied. 

Another well-known technique to reduce the size of the OBDDs consists in parti- 
tioning the OBDD HBurch et al. 1991l|Memel and Stangier 2000||Meinel and Stangier 200l| ), 
in the sense that, instead of using a unique OBDD to represent the transition rela- 
tion of a graph, one OBDD for each variable of the target nodes is considered: the 
global OBDD is obtained from all these OBDDs. Such partitioning is substantially 
different from the partitioning we propose here: all the source nodes occur in all 
the OBDDs, while in our approach we partition the source nodes and each of them 
occurs only once. 

Further attempts to reduce the complexity of the OBDD-representation can be 
found in l|McMillan 1996IICabodi 2001|l where various form of decompositions based 
on the use of complicated functions to combine OBDDs are defined. 



As far as bisimulation is concerned it is difficult to accurately list all the fields in 
which, in one form or another, the notion of bisimulation was introduced and now 
plays a central role. Among the most important ones are: Modal Logic, Concurrency 
Theory, Set Theory, and Formal Verification. In Model Checking several existing 
verification tools make use of bisimulation in order to minimize the state spaces 
of systems description and to check equivalence between transition systems. The 
verification environment XEVE IjBouali 1998|l provides bisimulation tools which 
can be used for both minimization and equivalence test. In general, in the case of 
explicit-state representation, the underlying algorithm used are the ones proposed 
by Kanellakis and Smolka IjKannellakis and Smolka 1990|l and by Paige and Tarjan 
| |Paige and Tarjan 1987| ) , while Bouali and de Simone algorithm l|Bouali and de Simone "1992)1 
is used in the case of symbolic representation. All these algorithms are based on 
negative strategies: start by considering that all the nodes bisimilar and separate the 
nodes when it is possible to prove that they are not bisimilar. On the contrary, a pos- 
itive strategy would start by considering that all the nodes not bisimilar and put to- 
gether nodes when they have been proved to be bisimilar. In IjDovier et al. 200T|l an 
algorithm for explicit representation which combines positive and negative strate- 
gies exploiting the notion of rank as been proposed. A symbolic version of the 
result in l|Dovier et al. 200T)l has been proposed in IjDovier et al. 2002ll and led to 
the development of a linear symbolic algorithm for strongly connected components 
computation (see l|Gentilini et al. 2fl08|l V The method we present here differs from 
all the previous ones because it is based on a fully positive strategy and it strongly 
exploits the notion of rank, together with the alternative symbolic representation 
we introduce in order to perform the bisimulation computation. 
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4 OBDDs representing graphs 

The way OBDDs are usually employed in Model Checking to represent the states' 
space N, sets of states S C N, and the transition relation >, is based on the 
following observations IjClarke et al. 1999jl : 

• we can safely assume that N — {0, 1}", i.e. each node is encoded as a binary 
number; 

• a set S C N is a. set of binary strings of length u, hence its characteristic 
function xs ■ {0, 1}" {0, 1}, where 

Xs(si, ■ • • = 1 ^ {si, . . . ,Su) e S 

is a boolean function, which can be represented using an OBDD; 

• > C N X N is a set of binary strings of length 2u and hence, again, its 
characteristic function 

X>{xi,...,Xu,yi,...,yi,) = 1 ^ {xi, . . . ,Xu) > {yi, ■ ■ ■ ,yu) 
is a binary function, which can be represented using an OBDD. 

In particular, in the OBDD representing > (without variable reordering) the first u 
levels (variables) represent the codes of the source nodes, while the second u levels 
(variables) represent the codes of the target nodes (see Figure P). 

* 

edge <n,m> ^' ; ^^ 

/ \ \ binary code of n 



/ ; \ bynaiy code of m 

/ 

1 

Fig. 1 

Example 5 

Consider the graph G in Figure\^ on the left. Using the binary variables x\ and xi 
for the code of the first node and the variables yi and y2 for the code of the second 
node we obtain that the characteristic function of this graph is 

i^xi A ^2 A ^yi A ^y2) V (xi A a;2 A ^yi A 2/2)V 

{xi A ^X2 A ^yi A ^2/2) V {xi A -1X2 A ^yi A 1/2) 

which is represented by the OBDD in Figure\^on the right. 

If we use this OBDD encoding to represent G/ = (without using variable reorder- 
ing or other minimization techniques) we always obtain that if \N/ = | = 2'""^, then 
the higher half of the OBDD is a complete tree with 2""^ — 1 nodes at the level 
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Fig. 2 



corresponding to yi (the first variable relative to the encoding of the target nodes, 
see Example 01. This is because two paths of length v which converge would repre- 
sent two or more different nodes whose sets of >-successors are the same. However, 
since different nodes in G/ = are not bisimilar, different nodes cannot have the 
same set of >-successors.^ Even in case G is not quotiented w.r.t. bisimulation, we 
obtain that (again, without using variable reordering) in the OBDD representing 
>, at level v there are at least \N/ = | — 1 different nodes. 

A second disadvantage of this (rather classical) way of using OBDDs to represent 
a graph, is given by the fact that "topological" repetitions in the structure of 
the graph G are not exploited in order to obtain further reductions. OBDDs are 
designed to reduce the dimensions of a graph "horizontally" : nodes of the OBDD at 
the same level are collapsed only when the sub-OBDDs rooted on them are equal. 
In general there is no way to take advantage of a situation as the one described in 
the following example. 

Example 6 

Consider the graph in Figure\^ on the left. The situation in the part A and B of 



the graph is similar, but this has no reduction effects on the corresponding OBDD 
(see Figure\^on the right) which at level 3 has \N\ — 1 nodes. 

^ If there is a node without outgoing edges, then its path reaches directly 0, this is the reason for 
which we have 2""^ — 1 instead of 2"~^ nodes at level v. 




Fig. 3 



8 



C. Piazza and A. Policriti 



A "vertical" reduction could be performed only if it were possible to "ignore" 
some level, which means do not care about the values of the variables on the levels 
that are ignored (see below). 

What we try to do in the next section is to propose an alternative way to use 
OBDDs in order to represent graphs in such a way to obtain: 

• a vertical reduction, i.e. we deal with OBDDs shorter than 2u, where u = 
\og\N\; 

• use topological similarities to obtain further horizontal reductions. 

In general, however, the global height of the OBDD remains 2u, which implies 
that, in the worst case, it is possible to have 0(2^") internal nodes in it. 

5 OBDDs representing graphs: layering and sharing 

As we observed, if we work on a graph G reduced by bisimulation, we can safely 
assume that no two different nodes have the same set of successors with respect 
to >. Hence, the graph can be represented as a collection of sets of > -successors 
(one for each node) . Notice that the so-called unique-table of most OBDD packages 
already performs this kind of optimization, see ( Somenzi 1999|l . 

Moreover, if we have a (generic) partition P on the graph, each block Bj (P-class) 
of the partition can be represented separately, allowing sharing among the OBDDs 
used in the representation. 

Below we will combine the above two ideas into an OBDDs representation min- 
imizing the size of the data structures involved. The technique will turn out to be 
suitable for use even while the quotient structure G/ = is determined starting from 
G (cf. Sections Hand Ell. 

The following definitions will be used: 

• let P = {Bi, . . . , Bp} be a partition of N; 

• for each block Bj G P consider the list Dj of the nodes reached from a node 
in Bf 

Dj = [b\3ac Bj{a>h% 

Let us assume the nodes in Dj are ordered using their binary codes and let 
\D,\^h,- 

• for each b G Dj let dj (6) be the binary representation of the position in which 
b occurs in D, {dj{b) is in {0, i}Liog'«.J+i); 

• for each a G Bj consider the boolean function >j{a) : {0, l}Li°g'ijJ+i {Q, 1} 
defined as 

>j{a){zi,...,z\ioghj\+i) = \/ X{d,{b)}{zi,---,zi\ogh,\+i), 
where xx is the characteristic function of X. 
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Remark 7 

It is clear that if we know P, all the Dj 's, and all the >j(a), we can infer >. 

Moreover, the reader can check that if P — {N}, the representation proposed 
collapses to the technique discussed in the previous section. For each node a we 
give the OBDD rooted at the end of the binary code of a. In the case P — {N} 
the connection between the representation presented in the previous section and the 
representation we propose is similar to the connection between the adjacency-matrix 
and the adjacency-list representations of a graph. 

The data structures that can be used to keep aU these information are the following: 

• the binary encodings of the nodes in N, where we assume that the first q = 
[logpj +1 (p is the number of blocks) digits represent the block Bj to which 
a belongs; 

• the following set of pairs: 

V, = {{b,d,{b))\beD,}, 

is a function associating to each element in Dj its position, whose character- 
istic function can be represented using an OBDD; 

• the boolean function >j{a) can be represented using an OBDD (to which a 
points). 

All the OBDDs can be kept in a unique table with no duplicate sub-OBDDs, i.e. it 
is possible that the OBDD of Vj and the OBDD of Vj' share some sub-OBDDs. In 
particular, whenever Vj and Vji have the same cardinality they share all the levels 
relative to the dj(&)'s. 

Using the above representation with a non-trivial partition P: the OBDDs of the 
functions >j (a) 's are shorter than u (vertical reduction) ; it is possible that a is not 
bisimilar to a', but >j{a) = >j/(a'), i.e. they point to the same OBDD (reduction 
due to topological repetitions). 

It immediately follows that the set of nodes which refer to the same OBDD can 
be itself represented using an OBDD of height u. 

Example 8 

Consider the graph in Figure\^ (see Example\^. Using the partition P — {A, B, C} 
we obtain. 

Dc^% L»B = [000,001,010] 1?^ = [001,011,100] 
>c(000) = 

>b(001) = (-Z1 A -Z2) V (zi A -Z2) >s(010) = (-zi A Z2) 
>a(011) = (-Z1 A -Z2) V (zi A -Z2) >a(100) = (-Z1 A Z2) 
Hence 001 and Oil share the same OBDD (and similarly OlO and 100 j, in particular 
3 nodes are sufficient. In Figure^we show the OBDDs relative to Va and Vb which 
can share entirely the last two levels. 

Notice we can still use all the optimization techniques that are provided to deal 
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Fig. 4 

with OBDDs and, moreover, we are not forced to keep all the OBDDs in central 
memory. 

If |7V| — 2", out is the maximum number of outgoing edges from a node, and we 
partition N in 2*^ classes with q ^ u (using the first q digits), then we obtain that: 

1. in each class there are 2"~* = k elements, with k « \N\\ 

2. each Dj has at most out * k elements; 

3. the OBDDs of the functions >j(a)'s have at most \\og{out * fc)J + 1 levels. 

If out « \N\, then we obtain that out * k can be considered as a constant w.r.t. 
|7V| = 2", the OBDDs of P/s have about u levels'^, and a large number of OBDDs 
of >j(a)'s are shared. 

The representation achieves good results, in particular, if there are few outgoing 
edges from each node or if there are topological repetitions. This happens, for 
instance, in the case of graphs obtained from programs to be analyzed when: a 
procedure is used more than once; different procedures perform symmetric actions; 
there are sequences of deterministic iterations. 

Example 9 

Consider a graph with 2" nodes composed by 2'' cliques of 2"^'^ nodes with u — q « 
u. Moreover the cliques are connected in a cycle as shown in Figure\^in the case u — 
5 and q = 3. This graph can be interpreted as the representation of a communication 
process in which all the agents belonging to the same clique are able to communicate 
and only two special agent in each clique are able to communicate directly with 
another clique. In this case, with the standard representation is necessary to use 
one OBDD with 2u levels. Using our representation it is sufficient to use: q OBDDs 
with u levels and one shared OBDD with u — g + 1 levels to represent the Di 's; 3 
OBDDs with u — q + 1 levels to represent the >j{a) 's; 3 OBDDs with u levels to 
represent which nodes share the same OBDD. Notice that the constant 3 is due to 
the fact that in each clique Cj there are three kind of nodes: the nodes which reach 
only the nodes in the clique, the node which reaches the clique Cj-i, and the node 
which reaches the clique Cj+i. Hence, using the standard representation, depending 
on the encoding of the nodes and on the variable ordering, it is possible that there 

^ Exactly u + [log{out * k)j + 1 levels. 
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are 2^" internal nodes in the OBDD, while using our representation there are at 
most 2i°s9+«-9+i + 3 * 2"-«+i + 3 * 2" < 2«+3 internal nodes. 

Another example in which it is immediate to verify that our representation in 
the worst case works better than the classical one, is the case of graphs that are 
downward n-ary trees. In this case, partitioning the nodes into blocks of cardinality 
k*n the Dj^s have cardinality k. 

6 Operations 

The operations on the representation we propose are similar to the ones on the clas- 
sical representation with OBDDs. In this section we briefly discuss how to compute 
the image and the counter-image of a set of states. 

Let 5 C A?' be a set of states. Assume that we want to compute >{S) = {b\3a G 
S{a > b)} (image computation). We have that 5 = Si U . . . U Sp, where each 5, is 
a subset of a block in P and we obtain that 

p 

XHsM = \/3z[{\J >i{a){z))AxT>Mz)]. 

i=l aGSi 

Hence, in order to iteratively apply this operation (reachability computation) it is 
necessary to intersect X>{S){y) with all the blocks of P. In general, given a set S 
of states its intersection 5, with the block Bi of P is represented by the OBDD of 
the function 

XSiix) = Xs{x) AxBiix). 

If we assume that P's blocks are defined using the first q digits of the codes of the 
nodes, then the elements of Si are the nodes in S whose code start with the binary 
representation of i. 
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In order to compute <{S) = {b \ 3a E S{b > a)} (pre-image computation), we 
have to compute 

9b{z) 3y{xs{y) ^XVB{y,z)) 

X«(S)(S) = V y iM9B{z)A>B{x)im 
BeP seB 

Using similar boolean functions it is possible to move from the representation we 
propose to the classical one and vice-versa. 

In the next section we concentrate on a computation of the previously proposed 
representation for a symbolic version of G/ =, obtained starting from a represen- 
tation of G. 

PART TWO 

An OBDD representation oi G/ = 

7 Acyclic case 

In order to obtain the OBDDs representation while computing G / =, we will use an 
idea that can be traced back to the so-called Ackermann encoding of hereditarily 
finite sets into natural numbers IjAckermann 19371 [Levy 1979| ). Such an encoding, 
inductively defined as 

A{a) = Sbe,2^W, 

establishes a bijection between the collections of hereditarily finite sets and the 
natural numbers. Hence, two sets (possibly represented by different means) will 
be bisimilar if and only if they are mapped by A into the same natural number. 
Therefore, the computation of A and the bisimulation relation arc naturally carried 
on together. In our approach we essentially redefine A using OBDDs in place of 
natural numbers and layering the definition on the ranks in order to take advantage 
of repetitions as illustrated in Section [S] Given a node a we will call this OBDD 
rank-based Ackermann encoding A(a) 

We start from the acyclic case because in this case the notion of rank partitions 
the graph G and gives an order between the classes of such a partition which allows 
to compute the OBDD representation of G/ =. 

Definition 10 {Rank - acyclic case) 

Given an acyclic graph G = (A, >, r), the rank of a node a G A is defined as: 

rankia) = i ^ ifV6e A^(a>5) 

[_ max{rank (b) \ a > b} + 1 otherwise 

The following lemma states the main property (for our application) of the rank. 
Lemma 11 

Let G be an acyclic graph and a,a' € N be two nodes of G. 

a = a' =^ rank (a) = rank (a'). 



Ackermann Encoding, Bisimulations, and OBDDs 



13 



Proof 

See (|Dovier et al. 2001|l . □ 

Our OBDD encoding of G/ = is defined rank by rank as follows: 

1. at each rank i we determine which are the nodes reachable from all nodes at 
rank i, i.e. the list Di\ 

2. then, we compute for each node a at rank i the OBDD representing >i{a) 
(two nodes a and a' at rank i are bisimilar if and only if >i(a) = >i(a')); 

3. subsequently we collect all the OBDDs obtained at rank i in the list Cadi 
(without repetitions) and we assign to each node a at rank i the number A(a) 
which is the position of >i{a) in Cadi] 

4. for each node a at rank i its binary encoding is replaced with («, A(a)). 

Hence, two nodes a and a' at rank i are bisimilar if and only if A(a) = A(a') 
and, in general, two nodes a and a' are bisimilar if and only if {rank (a) , A{a)) — 
{rank{a'),A(a')). For this reason we introduce the encoding pair (corresponding 
to the classical Ackermann encoding) of a node a as the pair {rank (a), A(a)). 

Definition 12 {OBDD encoding - acyclic case) 

Let G = {N, >,r) be an acyclic graph. For each rank i, and each node a of G we 
define, by induction on the rank, Di, Codi, >i(a), and A(a) as follows: 

Di ~ [{rank (&), A(6)) | 3a{rank {a) = i A a > b)] 

>i(a) = Va>hX{d.(6)} 

Codi = [>i{a) I rank (a) — i] 

A(a) — k if and only if Codi[k] ~ >i(a) 

where Di and Codi are ordered lists without repetitions, >i{a) is an OBDD, and 
A(a) is a natural number. 

Theorem 13 

Let G — {N, >,r) be an acyclic graph and a,a' E N be two nodes. 

{rank{a),A{a)) — {rank{a'),A{a')) a = a'. 

Proof 

(=^>) Consider the relation B defined as aBa' if and only if {rank {a), A{a)) — 
{rank (a'), A(a')). We prove that i? is a bisimulation. If rank (a) = rank (a') = 0, 
then nBn' satisfies the forward and the backward condition, since a and a' have 
no successors. If rank (a) = rank{n') = i and A(a) = A(a'), then >i{a) = >i{a'), 
hence for each b such that a > b and {rank {b) , A{b)) is the j*'' element of Donii 
there exists b' such that a' > b' and {rank {b'),A{b')) is the j*'* element of Domi, 
hence we have that bBb' . Similarly we can prove that for each b' such that a' > b' 
there exists 6 such that a>b and bBb', hence we have that aBa' satisfies the forward 
and the backward conditions. From the fact that _B is a bisimulation we have that 
it is included in =, i.e. the thesis. 

(<^=) By induction on the rank. If rank (a) = and a = a' , then from Lemma ITTl 
we have that rank (a') = 0, hence A(a) = A(a') — 0. If rank (a) = i and a = a', 



14 



C. Piazza and A. Policriti 



then from Lemma 1111 we have that rank (a') = i, hence we have to prove that 
>i{a) = >i{a'). If X{di(b)} is a disjunct in >i{a), then there exists b such that a>b 
and {rank (6), A(&)) is the di{hy^ element oiDomi. Since a = a', there exists b' such 
that a' > b' and b = b' . By inductive hypothesis, since rank (6) < z, we obtain that 
{rank{b),A{b)) — {rank {b'), A{b')) , hence X{di{b)} is a disjunct in >,;(a'). Similarly 
we can prove that if X{di(b')} is a disjunct in >i(a'), then it is also a disjunct in 
>i{a), from which >i{a) = >i{a') and hence, A(a) = A(a'). □ 

Example 14 

Consider the graph G in Figure\^ 

rank 3 

/\ 

7""\ 

a6 ^ aV rank 2 




Fig. 6 

At rank we obtain: 

Da ^9 

>o(ai) = >o(a2) = >(a3) = -L 
A(ai) = A(a2) = A(a3) ^ 

rarzfc 1; 

Di = [{0,0)] Pi = {(0,0,0)} 

>i(a4) = >i(a5) = -^zi A{ai) = A(a5) = 

At rank 2: 

I?2 = [(0,0),(l,0)] P2 = {(0,0,0), (1,0,1)} 

>2(a6) = V zi, >2(a7) = zi A(a6) = 0, A(a7) = 1 

At rank 3; 

i?3 = [(2,0),(2,1)] P3 = {(2,0,0),(2,1,1)} 
>3(a8) = -'21 V zi A(a8)=0 

Let us describe the steps of our algorithm in the acyclic case. 

Algorithm 1 {Acyclic case) 

1. for a ^ N do compute rank (a); — compute the rank 

2. p :— Taa,x{rank (a) \ a £ N}; 

3. for i = 0, p do Bi :— {a £ N \ rank (a) = i}; 

4. for i = 0, . . . , p do 

(a) Di :— [{rank (6), A(6)) | 3a{a G Bi /\ a > b)]; — determine Di 

(b) for {rank{b),A{b)) e A do 

di{b) = position of {rank (b), A{b)) in Di; 
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(c) for a E Bi do 

>i (a) = X{di{b) I a>&}; — compute the OBDD for a 

(d) Cadi — [>i{a) \ a G Bi]] — collect the OBDDs at rank i 

(e) for a E Bi do 

A(a) — position of >i{a) in Cadi] — compute the encoding 
for a 

Notice that in the algorithm >i{a) is computed as the OBDD of the function 
X{di{b) |a>6}- This is equivalent to consider the OBDD of Va>b ^{<ii(fc)}- order 
to compute the latter we need a procedure which from the OBDDs representing / 
and g computes the OBDD representing f W g, while the first does not require the 
use of such a procedure. 

Theorem 15 

Let G — {N, >, r) be an acyclic graph. Let symbsize{>i{a)) be the number of nodes 
in the OBDD representing >i(a) and symbsize{>) — '^^^j^ symbsize{>i{a)). Al- 
gorithm correctly computes the OBDD encoding of G/ = w.r.t. Definition 1121 
with a worst case time 0(|iVp). The worst case space complexity is 0(|A^1 + [ > | + 
symbsize{>)). 

Proof 

The correctness of the algorithm follows from Theorem II 31 

Step (1.) can be performed in time 0{\N\ + \ > |) using a visit of the graph (see 
UDovier et al. 200T|l '). Step (2.) can be performed in time p, which in the worst case 
is |A''| — 1. Step (3.) can be performed in time 0{\N\). Globally all the lists Di for 
i — 0, . . . , p have a length of | > |, since each edge is used to put an element in one 
of the lists exactly once (i.e. when the rank of its starting node is reached). In order 
to avoid to add an element twice in a list Di it is sufficient to keep a global list 
of the {rank (b) , A{b)) with a flag specifying whether during the ith iteration the 
element has already been taken or not. Hence, globally all the steps (4. a) take time 
0{\ > I). Similarly all the steps (4.b) take time 0{\ > \). The computation of >i(a) 
takes time 0(2'°^'''), where hi — \Hi\ (see (IClarke et al. 1999 )'). In the worst case 
all the hi are |iV|, hence in the worst case the computation of >i{a) costs 0{\N\). 
So globally all the steps (4.c) cost 0(|7V|2). If we keep ah the OBDDs m a unique 
table, we obtain that globally all the steps (4.d) and (4.e) cost 0{\N\). Hence, since 
I > I < l-^Pj the worst case time complexity is 0(|A^|^). 

The worst case space complexity result is trivial. □ 

Notice that the encoding wc obtain in the acyclic case is an encoding a la Ack- 
ermann (see H Ackermann 1937)1 with two extra ingredients: 

1) since we are encoding only a particular graph, and not all the possible acyclic 
graphs as in the general case, we keep the encoding more compact (it is exactly 
to this end that we use the notion of rank); 

2) we use OBDDs, instead of an iterated exponential function, to compute the 
encoding. The use of an iterated exponential function would be very inefficient 
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because of its fast growing. Moreover, we do not need it, since, as explained in the 
previous point, we use are encoding only one graph. The connections between the 
Ackermann encoding and our encoding are further discussed in IjPiazza 2002|l . 



8 Cyclic case 

In order to generalize our encoding to the cyclic case, first of all we need a gener- 
alization of the notion of rank. We give here the definition of such a generalization 
which has been introduced in IjDovier et al. 200T)l . 

Given a graph G = {N, >,r), let G"'"' = {N""", c(r)) be the graph of the 
strongly connected components, where c(r) is the strongly connected component of 
r. Given a node a G N, we refer to the node of G"*^'^ associated to the strongly 
connected component of a as c(a). Observe that G""^'^ is acyclic and if G is acyclic 
then G'"^'' is G itself. 

We need to distinguish between the well-founded part and the non-well-founded 
part of a graph G. 

Definition 16 ( Well-founded part) 

Let G — {N, >, r) and a G N. G{a) = {N{a), >\N{a)y o.) is the subgraph of G of the 
nodes reachable from a. WF{G), the well-founded part of G, is WF{G) = {a E N : 
G[a) is acyclic}. 

The following is an extension of the previous notion of rank (cf. Definition 110(1 
suitable for dealing with the cyclic case. 

Definition 17 {Rank - general case) 

Let G = {N, >,r). The rank of a node a of G is defined as: 



rank (a) 



if a is a leaf in G 

— 1 if c(a) is a leaf in G"'^'^ and a is not a leaf in G 

max({l rank (b) : c{a) c(b), b £ WF{G)} U 

{rank (b) : c(a) c{b), b WF{G)}) otherwise 



Since G^'^'^ is always acyclic, the definition is correctly given. If G is acyclic then 
G = G^'^'^ and the above definition reduces to the one given in the acyclic case 
(Definitional). Notice that if the graph is strongly connected all the nodes have 
rank —1. This is never the case when our graph G is obtained from a labeled graph 
G'. 

Lemma 18 

Let G be a graph and a,a' € N be two nodes of G. 

a = a' rank (a) = rank (a'). 

Proof 

See UDovier et al. 2n01|l . □ 

Notice that the following also holds: 

a>b rank (b) < rank (a). 
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In particular, as pointed out in IjDovier et al. 2001|l . in order to determine if two 
nodes at rank i are bisimilar it is sufficient to know the bisimulation = on the nodes 
of rank less than i and the edges among the nodes at rank i. 

The main idea behind the extension of our way to compute the OBDD encoding 
in the cychc case is that of assigning to each node two encodings. This means that 
we treat a > 6 in an asymmetric way with respect to >: a "trick" first proposed 
by Fraenkel and Mostowski in their permutation of the universe technique (see 
Pech 1978|l ^ to model cyclic membership relations. In particular, the two encodings 
are >i{a) and d{a) and we compute >i{a) (Ihs of a > 6) using d{b) (rhs of a > 6). 

Let us assume that for each j < i we have correctly encoded all the nodes of rank 
j, i.e. we have assigned to each node a of rank j a number A(a) in such a way that 

a = a' <^ {rank (a), A{a)) ~ {rank {a'), A{a')) . 

We want to extend the encoding to the nodes at rank i. First we compute D^, all 
the >^(a)'s, God[, and all the A'(a)'s as in the acyclic case, but without considering 
the nodes at rank i. This means that >[{a) and A' (a) are a first approximation of 
>i{a) and A(a) in which we consider only the edges reaching a node b whose rank 
is less than i. In particular: 

D[ = [{rank (b), A{b)) \3a{rank (a) — i A a > b A rank (b) < i)], 

a^bAb£D'. 

Cod[ = [>[{a) I rank [a] = i], and 

A' (a) ^k if and only if Cod[[k] = >[{a). 

where D[ and Cod[ are lists without repetitions. 

For each node a at rank i consider a variable (which ranges over boolean 
functions) and a variable da (which ranges over natural numbers). If there are 
nodes at rank i, then we impose 

d^e{m + l,...,\D[\+r,}. 

We consider all the boolean equations 

Wa*'(^) = I V X{d.}(^-)) y>[{a){-z) (1) 

\a^h/\rank {b)—i J 

Notice that the second disjunct in the definition of is a boolean function, while 
we cannot explicitly write the first part until we know the values of the da's. Notice 
also that it is possible to replace the boolean equation with a numeric equation 
using the following definition 

j=\D[\^l \a^bl\rank{b)=i ^' " ^ / 
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with d'{a) = A'{a) + \D^\+ri + l. This means that if S* = {>i{a), di{a) \ rank (a) = i} 
is a solution of this system of boolean equations, then >i{a) is the OBDD associated 
to a and d{a) is the position of a in Di. Moreover, we consider all the boolean 
equations of the form 

(Wf'^wJ') ^ da = da' (2) 
which can be expressed as numeric equations as 

TOaa;(|da - da'|, 1) maxdwf ' - W^'|, 1) ' 

Let SySi be the system containing all the equations (1) and (2). We put as objective 
function to be minimized 

max{da I rank (a) = i}, 

i.e. we want to maximize the number of equalities between the da's. Let S = 
{>i{a),di{a) I rank (a) = i} be a solution of the system SySi which minimizes 
the objective function. We assign to each node a of rank i the code 

A(a) = d,(a)-(|I?:| + l). 

Notice that we always obtain that all the nodes a at rank —1 have A(a) — 0. This 
is correct, since we are working on unlabeled graphs. 

In order to show that our technique is correct we begin showing that the system 
has always at least one solution. We prove this by proving that from the maximal 
bisimulation we are able to describe a solution. 

Lemma 19 

Let G be a graph, and let = be the maximum bisimulation over G. Consider an 
ordering ord (starting from 0) of the equivalence classes of G/ = at rank i and for 
each node a at rank i define 

d,{a) = ord{[a]) + \D'^\ + 1 

= (Va>bAra«fe(b)=»X{d(fc)}) V >^(a). 

The set S = {>i(a), di{a) \ rank (a) = z} is a solution of SySi. 
Proof 

We recall that we are assuming that for all the nodes at rank less than i we have 
that 

a = a' {rank {a),A{a)) = {rank {a'),A{a')). 

By induction on the rank. 

Let i = —1. Since = is the maximum bisimulation all the nodes at rank —1 are 
bisimilar (see l|Aczel. 19881 IPiazza 20n2|l \ hence there is only one equivalence class 
and all the d{a) are equal to 1. We have >'_i{a) = ± for all the nodes at rank 
— 1. All the equations of the W^""s are satisfied, since we use them to define the 
>_i(a)'s in terms of the (i_i(a)'s. We have to prove that >_i(a) = >_i(a') ^ 
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d-i{a) — d-i{a'). So what we have to prove is that aU the >_i(a)'s are equal. If 
a is a node at rank —1, then there exists at least a node b such that a > b and 
rank (b) = —1. Hence, if rank (a) = —1 we obtain >_i(a) — X{i}j i-C- all the 
>_i(a)'s are equal. 

Let i = 0. Since = is the maximum bisimulation all the nodes at rank are 
bisimilar (see (|Aczel. 19881 IPiazza 2n02|l V hence there is only one equivalence class 
and all the c?o(a) are equal to 1. All the equations of the W^°'s are satisfied, since 
we use them to define the >o(a)'s in terms of the do(a)'s. We have to prove that 
>o(fl) = ^o(a') do{a) — do^a'). Hence, what we have to prove is that all the 
>o{a)'s are equal. If a is a node at rank 0, then it has no outgoing edges, therefore 
we obtain >o{a) = _L, i.e. all the >o(a)'s are equal. 

Let i = k + 1. All the equations of the Wl'^'s are satisfied, since we use them to 
define the >i(a)'s in terms of the di{a)'s. We have to prove that >i{a) = >i{a') ^ 
di{a) — di{a'). If >i{a) = >i{a'), then we have to prove that di{a) = di(a'), i.e. 
we have to prove that a = a'. If a > 6 and rank{b) < i, then in >i{a) there is 
a disjunct of the form X{di(b)} with di{b) < \D'^\. The same disjunct is in >i(a'), 
and hence we obtain that there exists b' such that b = b' and a' > b' . If a > 6 and 
rank (5) = i, then in >i(a) there is a disjunct of the form X{di{b)} with di{b) > \D^\, 
hence the same disjunct must be in >i(a'), i.e. (from the definition of di{b)) there 
exists b' such that a' > b' and b' = b. Similarly for the backward condition. Hence 
we have a = a', i.e. di{a) = di(a'). If di{a) — di{a'), then a = a' . We have to prove 
that >i(a) = >i{a'). If in >i{a) there is a component of the form X{di(b)} with 
di{b) < \D[\, then a>b and rank (b) < i, hence there exists b' such that a' > b' and 
b = 6', i.e. di{b) = di{b') from which we obtain that in >i{a') there is a component 
of the form X{di(b)}- If >i(a) there is a component of the form X{di{b)}j with 
di{b) > \Di\, then since a = a' , there must be b' such that a' > b', and b = b', from 
which we obtain that in >i(a') there is a component of the form X{di{b)}- Similarly, 
it is possible to prove that all the components that are in >i(a') are also in >,;(a), 
from which we obtain that >i{a) = >i{a'). □ 

On the ground of the above result we can show that with the proposed encoding 
we obtain a (unique and symbolic) representation of G / =. 

Theorem 20 

Let G = {N, >, r) be a graph and a,a' ^ N be two nodes. 

{rank{a),A{a)) = {rank{a'),A{a')) <^ a = a' . 

Proof 

(^) Let B be defined as aBa' if and only if {rank (a), A{a)) = {rank{a'),A{a')). 
We prove that i? is a bisimulation. If rank (a) = —1 and aBa', then we have 
nothing to prove since all the equivalence relations between nodes at rank —1 are 
bisimulations. Also the case in which rank (a) = is trivial. Let rank (a) — i and 
aBa'. If a>6 and rank (b) < i, then in >i{a) there is a disjunct of the form X{di{b)} 
with di{b) < \D'^\. The same component occurs in >i(a'), hence there must exists b' 
such that a' >b', rank{b') < i and {rank {b),A{b)) = {rank {b'), A{b')) , from which 
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we obtain that b'Bb. If a>b and rank (b) — i, then in >i(a) there is a component of 
the form X{di{b)} with di{b) > \D^\. Since >i{a') = >i{a) we obtain that in >i{a') 
there is a component of the form X{di(b)}- This imphes that there exists b' such that 
a' > b', rank (6') = i and di{b) = di{b') in the solution of the system. From the fact 
that di{b) = di{b') we obtain >i{b) — >i{b'), hence bBb'. The backward condition 
is similar. 

(<;=) From Lemma [T51 we have that the maximal bisimulation gives us a solution. 
From each solution we are able to define a bisimulation B such that 

max{di{a) \ rank (a) = i} + 1 

is the number of classes at rank i (see (=^)). Hence, it must be that the solution 
obtained from = minimizes max{di{a) \ rank (o) = i} at each rank. □ 

Notice that in the numeric system we use the exponential function 2^'' in order to 
compute >i{a). We could have used any other function / such that: 

• f{[b,b\R]) ~ f{[b\R]) (/ does not depend on the number of repetitions); 

• 62, • ■ • , bii]) = f{[bii, bi2, . . . , bih]), where on the r.h.s. we have a permu- 
tation of the l.h.s. (/ does not depend on the ordering); 

• f{Ll) ^ f{L2), if in LI there is an element which is not in L2 or viceversa. 

Example 21 

Let us assume that at rank i we have the sub-graph presented in Figure^ and that 
>i{a) = >i{c) — f{z) and >^(6) = >iid) = ff(^)- In this case a minimal solution 




Fig. 7 

of the system of boolean equation is given by 

d,{a) = d,{c) = \D[\ + 1, d,{b) = d,{d) ^\D[\+ 2, 
and hence we can assign 

A{a) = A(c) = Q,A{b) = A{d) = 1. 
Let us describe the steps of our algorithm in the cyclic case. 

Algorithm 2 {Cyclic case) 

1. for a N do compute rank (a); — compute the rank 

2. p :— max{rank (a) | a G N}; 

3. P:={B, |z = G,...,p}; 

4. for i = 0, . . . , p do 

(a) £>■ := [{rank{b),A{b)) | 3a{a G B, A a > b A rank (6) < i)]; 
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(b) for (ranfc(&),A(6)) e D[ do 

di{h) = position of {rank (6), A(6)} in £>■; 

(c) for a E Bi do 

^'i (^) = X{di{b) I a^bArank {b)<i}] 

(d) Codr ^ [>'^{a) \ a e B,]; 

(e) for a G Bi do 

A' (a) = position of >^(a) in Cod^; 

5. S'ysi = 0; 

6. for a G Bi do 

Sysi = Sys^ A Wf 1 = X{db I a^bAbeBi} V >-(a); 

7. for a, a' G do 

Sysi = S'ysi A (Wf ' ^ W^') ^ da = da'; 

8. S* = a solution of S'ySi minimizing maa:{da | a G S,} = {>i(a), (ii(a) | a G 
B^}; 

9. for a G Bi do 

Aia) ^ d^{a) ~ {\D[\ + 1); 

Theorem 22 

Let G = {N, >,r) be a graph. Let symbsize{>i{a)) be the number of nodes in the 
OBDD representing >i(a) and symbsize{>) — '^^^j^ symbsize{>i{a)). Algorithm 
Incorrectly computes the OBDD encoding of G/ = with a worst case time 0{\N\'^ + 
I > I log |A^|). The worst case space complexity is 0(|A^| + | > | + symbsize{>)). 

Proof 

The correctness of the algorithm follows from Theorem 1201 

Steps (l.)-(4.) have a worst case time complexity 0{\N\'^. As far as steps (5.)-(8.) 
are concerned, notice that we are only interested in the solution of Sysi. As it follows 
from Lemma lH^ the solution S can be determined using the maximum bisimulation. 
This can be found using Paige- Tarj an algorithm ( |Paige and Tarjan 1987| ) in time 
0{\ > I log \N\). From the maximum bisimulation we can build >i{a) in time 0{\N\). 
Hence, globally aU the steps (5.)-(8.) take time 0(| > | log |iV| + \N\'^). It is plain 
that all the steps (9.) take time 0{\N\). 

The worst case space complexity result is trivial. □ 

Notice that in order to obtain the solutions of all the SySiS in time 0(|>| log \N\ + 
\N\'^) we exploit Paige- Tarjan ( |Paige and Tarjan 1987| ) algorithm. This is possible 
since SySi is nothing but a boolean encoding of the bisimulation problem for the 
nodes at rank i. Having an algorithm which computes the maximum bisimulation 
in time T we can find the solutions of all the Sysi^s in time T + 0{\N\'^), where the 
0(|iVp) time complexity is due to the construction of all the OBDDs >i(a). 

9 Final Considerations 

Given a graph G our method allows to compute a symbolic representation of G/ =. 
Hence, the smaller is G/ = with respect to G the smaller is our representation with 
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respect to the one obtained by applying the standard method (briefly described 
in Sectional). To show this fact let us consider some extreme cases. Let G be a 
graph constituted by n self- loops. Since the n nodes are all bisimilar G/ = has 
only one node with one self- loop and our representation of G/ = requires only one 
OBDD with 2 internal nodes. The size of the standard OBDD representation of 
G grows with n. For instance, with n = 4 the OBDD has 9 internal nodes, while 
with n = 8 it has 19 internal nodes. Similarly, if wc consider a graph G of n nodes 
constituting a loop, we get that, since G/ = has only one node and one self-loop, 
our representation requires only one OBDD with 2 internal nodes. Again the size of 
the standard representation grows with n. For instance, if rt = 4, then the OBDD 
has 8 nodes, while with n = 8 the OBDD has 21 nodes. We obtain similar results 
also considering graphs constituted by chains of nodes. 

Other cases in which our representation uses less space than the standard one can 
be easily obtained by considering graphs in which the same structure is repeated 
at different ranks. In fact, in these cases we can reuse the same OBDDs at different 
ranks. 

By combining the two above observations, we can build classes of graphs in which 
the same structure is repeated at different ranks of the bisimulation quotient. Also 
in these cases we can reuse the same OBDDs at different ranks. 

Recently a joint CWI/INRIA project has been started to establish an "official" 
benchmark suite for large transition systems. A preliminary version of the bench- 
mark suite, called VLTS (Very Large Transition Systems) Benchmark Suite, is avail- 
able at http : //www . inrialpes . f r/vasy/cadp/resources/benchmarkjDCg.html. 
The benchmarks have been obtained from various case studies modelling commu- 
nication protocols and concurrent systems. Many of these case studies correspond 
to real life, industrial systems. Two peculiarities of the benchmarks (see also their 
graphical representations in the web page) suggest that we can strongly exploit the 
OBBDs reuse typical of our encoding. Such characteristics correspond to: 1) the 
presence of deadlocks and; 2) a low number of outgoing edges per nodes. From our 
point of view the presence of deadlocks ensures that there are nodes at different 
ranks, since all the deadlocks are nodes of rank 0, all the nodes which can reach a 
deadlock in one step are at rank 1, and so on. Moreover, a low number of outgoing 
edges from a node increases the probability of using the same OBDDs to represent 
nodes at different ranks (see Section |SJ. Experimentation is in progress. 

10 Conclusions 

In this paper we first proposed a way to represent graphs via OBDDs provided 
they are reduced with respect to the maximum bisimulation =. Our technique 
is based on the observation that, in a graph quotiented by =, each node can be 
uniquely characterized by the set of its successors. Moreover, the representation we 
propose can be computed on a graph layered with respect to an equivalence relation. 
This allows, in general, sharing among the employed OBDDs, thereby guaranteeing 
savings in space. 
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The second part of the paper shows how, when the layering is based on a suitable 
notion of rank, the representation can be used as an Ackermann encoding. Such an 
encoding allows a computation of the quotient structure together with its OBDDs 
representation. This second part begins with the treatment of acyclic graphs and 
then provides the representation in the general case as a solution of a system of 
equations. The encoding we propose, by exploiting the (a priori) knowledge on the 
size of the graph (set), greatly reduces the size of the involved numbers with respect 
to the standard Ackermann encoding. 

It should be noted that the Ackermann encoding is, in fact, very similar to the 
OBDD representation of a graph: both techniques are designed to provide a com- 
pact representation of a graph and are based on a binary representation. The main 
difference is the inability of the OBDD representation to fully exploit the space sav- 
ings induced by bisimulation, as the only sharing automatically introduced by the 
OBDD representation allows avoiding repetition of nodes having equal immediate 
successors. On the other hand, however, the Ackermann encoding was a map de- 
signed (for entirely different purposes) to embed the entire universe of well-founded 
hereditarily finite sets, hence the numbers involved tend to grow very fast and the 
cyclic sets could not be mapped. 

In this paper we have tried to retain the good aspects of both techniques intro- 
ducing a method integrating Ackermann idea of (uniquely) encoding sets by rank 
and representing the encoding by OBDD's instead of natural numbers. 

A similar approach for the computation of the quotient w.r.t. the maximum 
simulation has been considered in l|Gentilini et al. 20011 IPiazza 2 0021. The main 
difference in the case of simulation is that while considering the nodes at rank i 
it is also necessary to update the encodings of the nodes at rank less than i in 
an incremental way. The encoding presented in (Gentilini et al. 20011 IPiazza 2002|l 
differs from the one presented here because it encodes for each node a the set of 
nodes which are simulated by a. We are currently investigating the computation of 
the simulation quotient exploiting the encoding presented here. 
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